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Abstract 



O 
O 

^ ■ This paper considers the problem of secret communication over a two-receiver multiple-input multiple-output 



(MIMO) Gaussian broadcast channel. The transmitter has two independent messages, each of which is intended for 



. one of the receivers but needs to be kept asymptotically perfectly secret from the other. It is shown that, surprisingly, 

under a matrix power constraint both messages can be simultaneously transmitted at their respective maximal secrecy 
rates. To prove this result, the MIMO Gaussian wiretap channel is revisited and a new characterization of its secrecy 
capacity is provided via a new coding scheme that uses artificial noise and random binning. 



Index Terms 
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\Q ' Artificial noise, broadcast channel, channel enhancement, information-theoretic security, multiple-input multiple- 

00 ' 

. output (MIMO) communications, wiretap channel 

cn 

O | I. Introduction 

On 
O ■ 

Rapid advances in wireless technology are quickly moving us toward a pervasively connected world in which a 
^ | vast array of wireless devices, from iPhones to biosensors, seamlessly communicate with one another. The openness 
& ' of the wireless medium makes wireless transmission especially susceptible to eavesdropping. Hence, security and 
privacy issues have become increasingly critical for wireless networks. Although wireless technologies are becoming 
more and more secure, eavesdroppers are also becoming smarter. Sole reliance on cryptographic keys in large 
distributed networks where terminals can be compromised is no longer sustainable from the security perspective. 
Furthermore, in wireless networks, secure initial key distribution is difficult and, in fact, can be performed in perfect 
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Fig. 1. MIMO Gaussian broadcast channel with confidential messages. 



secrecy only via physical layer techniques. Therefore, tackling security at the very basic physical layer is of critical 
importance. 

In this paper, we study the problem of secret communication over the multiple-input multiple-output (MIMO) 
Gaussian broadcast channel with two receivers. The transmitter is equipped with t transmit antennas, and receiver 
k, k = 1, 2, is equipped with r*. receive antennas. A discrete-time sample of the channel can be written as 



Y k [m] = H fc X[m] + Z fc [m], k = 1, 2 



(1) 



where is the (real) channel matrix of size r k x t, and {Zfc[m]} m is an independent and identically distributed 
(i.i.d.) additive vector Gaussian noise process with zero mean and identity covariance matrix. The channel input 
{X[m]} m is subject to the matrix power constraint: 



1 n 

- £ (X[m]X T [m]) 



-< s 



(2) 



m = i 



where S is a positive semidefinite matrix, and denotes "less than or equal to" in the positive semidefmite 
ordering between real symmetric matrices. Note that (f2]) is a rather general power constraint that subsumes many 
other important power constraints including the average total and per-antenna power constraints as special cases. 

Consider the communication scenario in which there are two independent messages W\ and Wi at the transmitter. 
Message W\ is intended for receiver 1 but needs to be kept secret from receiver 2, and message W% is intended for 
receiver 1 but needs to be kept secret from receiver 2. (See Fig. [TJfor an illustration of this communication scenario.) 
The confidentiality of the messages at the unintended receivers is measured using the normalized information- 
theoretic quantities [1], [2]: 



-I(Wx;Yl 
n 



and -I(W 2 ;Y\ l ) -> 

n 



3 



where Y£ := (Y^[l], . . . , Y^[n]), and the limits are taken as the block length n — ► oo. The goal is to characterize the 
entire secrecy rate region C s (Hi, H2, S) = {{R\, -R2)} that can be achieved by any coding scheme. C s (Hi, H2, S) 
is usually known as the secrecy capacity region of the channel. 

In recent years, information-theoretic study of secret MIMO communication has been an active area of research. 
(See [3] for a recent survey of progress in this area.) Most noticeably, the secrecy capacity of the MIMO Gaussian 
wiretap channel was characterized in [4]-[6] for the multiple-input single-output (MISO) case and [7]-[10] for 
the general MIMO case. The secrecy capacity region of the MIMO Gaussian broadcast channel with a common 
and a confidential messages was characterized in [11]. The problem of communicating two confidential messages 
over the two-receiver MIMO Gaussian broadcast channel was first considered in [12], where it was shown that 
under the average total power constraint, secret dirty -paper coding (S-DPC) based on double binning [13] achieves 
the secrecy capacity region for the MISO case. For the general MIMO case, however, characterizing the secrecy 
capacity region remained as an open problem. 

The main result of this paper is a precise characterization of the secrecy capacity region of the (general) MIMO 
Gaussian broadcast channel, summarized in the following theorem. 

Theorem 1: The secrecy capacity region C s (Hi,H2,S) of the MIMO Gaussian broadcast channel (Q]) with 
confidential messages W\ (intended for receiver 1 but needing to be kept secret from receiver 2) and W2 (intended 
for receiver 2 but needing to be kept secret from receiver 1) under the matrix power constraint (fSJ) is given by the 
set of nonnegative rate pairs R2) such that 



where \ Tk denotes the identity matrix of size x r^. 

Remark 1: Note that the rate region © is rectangular. This implies that under the matrix power constraint, 
both confidential messages W\ and W2 can be simultaneously transmitted at their respective maximal secrecy rates 
(as if over two separate MIMO Gaussian wiretap channels). The secrecy capacity of the MIMO Gaussian wiretap 
channel under the matrix power constraint was characterized in [9], by which the rate region ([3]) can be rewritten 
as the set of nonnegative rate pairs (R\, R2) such that 



and 




(3) 



and 




(4) 
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Remark 2: Also note that if B* is an optimal solution to the optimization program: 

max (log |I ri + HiBHj| - log |I r2 +H 2 BHj[), (5) 

then B* simultaneously maximizes both objective functions on the right-hand side (RHS) of ([3]). On the other hand, 
the optimization programs on the RHS of © do not, in general, admit the same optimal solution. As we will see, 
this makes © a better choice when it comes to proving the achievability part of the theorem. 

It is rather surprising to see that under the matrix power constraint, both confidential messages W\ and W2 
can be simultaneously transmitted at their respective maximal secrecy rates over the MIMO Gaussian broadcast 
channel (Q]). As we will see, this is due to the fact that there are in fact two different coding schemes: one uses 
only random binning, and the other uses both random binning and artificial noise. Both of them can achieve the 
secrecy capacity of the MIMO Gaussian wiretap channel. Through S-DPC (double binning) [13], both schemes 
can be simultaneously implemented in communicating confidential messages W\ and W2 over the MIMO Gaussian 
broadcast channel (Q]). 

As a corollary, we have the following characterization of the secrecy capacity region under the average total 
power constraint. The result is a simple consequence of [14, Lemma 1]. 

Corollary 1: The secrecy capacity region C s (Hi,H 2 ,-P) of the MIMO Gaussian broadcast channel (fl]) with 
confidential messages W\ (intended for receiver 1 but needing to be kept secret from receiver 2) and W2 (intended 
for receiver 2 but needing to be kept secret from receiver 1) under the average total power constraint: 



n 

m=l 



<P (6) 



is given by the set of nonnegative rate pairs {R\,R2) such that 

#1 < i log |I ri + H1B1HJI - i log |I r2 + H2B1H2I 



and i?2 < r 



I r2 +H 2 (B 1 +B 2 )H| 



^log 



I ri +H 1 (B 1 +B 2 )H[ 



I ri + HiBiH[ 



(7) 



I ra + H 2 B!HT 

for some positive semidefmite matrices Bi and B 2 such that Tr(Bi + B 2 ) < P. 

Remark 3: Unlike Theorem [T] under the average total power constraint, the secrecy capacity region of the MIMO 
Gaussian broadcast channel is, in general, not rectangular. 

The rest of the paper is devoted to the proof of Theorem Q] As mentioned previously, the rectangular nature of 
the rate region ([3]) suggests that the result is intimately connected to the secrecy capacity of the MIMO Gaussian 
wiretap channel. The secrecy capacity of the MIMO Gaussian wiretap channel under the matrix power constraint 
was previously characterized in [9], where it was shown that Gaussian random binning without prefix coding is 
optimal. In Section [TTJ we revisit the MIMO Gaussian wiretap channel problem and show that Gaussian random 
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Fig. 2. MIMO Gaussian wiretap channel. 

binning with prefix coding can also achieve the secrecy capacity, provided that the prefix channel is appropriately 
chosen. In Section [Till we prove Theorem [TJ using two different characterizations of the secrecy capacity of the 
MIMO Gaussian wiretap channel and S-DPC (double binning) [13]. Numerical examples are provided in Section HVl 
to illustrate the theoretical results. Finally, in Section [V] we conclude the paper with some remarks. 



In this section, we revisit the problem of the MIMO Gaussian wiretap channel under a matrix power constraint. 
The problem was first considered in [9], where a precise characterization of the secrecy capacity was provided. 
The goal of this section is to provide an alternative characterization of the secrecy capacity which will facilitate 
proving Theorem [TJ More specifically, we wish to provide a MIMO wiretap channel bound on the secrecy rate R2 
which will match the RHS of ©. 

For that purpose, consider again the MIMO Gaussian broadcast channel ([TJ) but this time with only one confidential 
message W at the transmitter. Message W is intended for receiver 2 (the legitimate receiver) but needs to be kept 
secret from receiver 1 (the eavesdropper). The confidentiality of W at receiver 1 is measured using the normalized 
information-theoretic quantity [1], [2]: 



The channel input {X[m]} m is subject to the matrix power constraint (|2]). The goal is to characterize the secrecy 

capacity C S (H2, Hi, sj^, which is the maximum achievable secrecy rate for message W. This communication 

scenario, as illustrated in Fig. [2] is widely known as the MIMO Gaussian wiretap channel [4]-[9]. 

'in our notation, the first argument in C s (-) represents the channel matrix for the legitimate receiver, and the second argument represents 
the channel matrix for the eavesdropper. 



II. MIMO Gaussian Wiretap Channel Revisited 
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In their seminal work [2], Csiszar and Korner provided a single-letter characterization of the secrecy capacity: 

C a (H 2 ,Hi,S) = max [/([/; Y 2 )-/([/; YO] (8) 
(U,X) 

where U is an auxiliary variable, and the maximization is over all jointly distributed (£7, X) such that U — > X — > 
(Yi,Y2) forms a Markov chain and E[XX T ] H S. Here, I(U, Y&) denotes the mutual information between U 
and Yfc. As shown in [2], the secrecy rate on the RHS of ([8]) can be achieved by a coding scheme that combines 
random binning and prefix coding [2]. More specifically, the auxiliary variable U represents a precoding signal, 
and the conditional distribution of X given U represents the prefix channel. In [9], Liu and Shamai further studied 
the optimization problem on the RHS of ([8]) and showed that a Gaussian U = X is an optimal solution. Hence, a 
matrix characterization of the secrecy capacity is given by [9] 



C S (H 2 , Hi, S) = max (\ log |I r2 + H 2 BHT| - \ log |I ri + HiBHjA . 



(9) 



We may conclude that Gaussian random binning without prefix coding is an optimal coding strategy for the MIMO 
Gaussian wiretap channel. 

Next, we show that a different coding scheme that combines Gaussian random binning and prefix coding can 
also achieve the secrecy capacity of the MIMO Gaussian wiretap channel. This leads to a new characterization of 
the secrecy capacity, summarized in the following theorem. 

Theorem 2: The secrecy capacity C S (H 2 , Hi, S) of the MIMO Gaussian broadcast channel £[]) with a confidential 
message W (intended for receiver 2 but needing to be kept secret from receiver 1) under the matrix power constraint 
(fSJ) is given by: 



C s (H 2 ,Hi,S)= o max s Qlog 



I r2 + H 2 SHT 



I r2 + H 2 BHJ 



I^+HiSHj 



I ri + HiBH[ 



(10) 



Remark 4: The achievability of the secrecy rate on the RHS of (fTOl can be obtained from the Csiszar-Korner 
expression ([8]) by choosing X = U + V, where U and V are two independent Gaussian vectors with zero means 
and covariance matrices S — B and B, respectively. This choice of (U, X) differs from that for (O in two important 
ways: 

1) In (fTOl . the input vector X always has a full covariance matrix S. For (O, the covariance matrix of X needs 
to be chosen to solve an optimization program; the full covariance matrix S is not always an optimal solution. 

2) In (fTOl) . the conditional distribution of X given U may form a nontrivial prefix channel. For (O, U = X so 
prefix coding is never applied. 

Remark 5: Note that the prefix channel in (flOl is an additive vector Gaussian noise channel, so the auxiliary 
variable V represents an artificial noise [15] sent (on purpose) by the transmitter to confuse the eavesdropper. 



Since the artificial noise has no structure to it, it will add to the noise floor at both legitimate receiver and the 
eavesdropper. 

The converse part of the theorem can be proved using a channel-enhancement argument, similar to that in [9]. 
The details of the proof are provided in Appendix U 

III. MIMO Gaussian Broadcast Channel with Confidential Messages 

In this section, we prove Theorem [T] To prove the converse part of the theorem, we will consider a single- 
message, wiretap channel bound on the secrecy rates R\ and i? 2 . More specifically, note that both messages W\ 
and W2 can be transmitted at the maximum secrecy rate when the other message is absent from the transmission. 
Therefore, to bound from above the secrecy rate we assume that only W\ needs to be communicated over the 
channel. This is precisely a MIMO Gaussian wiretap channel problem with receiver 1 as legitimate receiver and 
receiver 2 as eavesdropper. Reversing the roles of receiver 1 and 2, we have from Q that 



Ri < Cs(Hi, H2, S) 



max 
0^B-<S 



-log|I ri +HiBHf| 



- log \I r2 + H 2 BH 2 T 



(11) 



Similarly, to bound from above the secrecy rate R2, let us assume that only W2 needs to be communicated over the 
channel. This is, again, a MIMO Gaussian wiretap channel problem with receiver 2 playing the role of legitimate 
receiver and receiver 1 playing the role of eavesdropper. By Theorem |2j 



R2 < C s (H2, Hi, S) 

= max ( — log 
o^B^s \ 2 6 



I r , + H 2 SH^ 



Ir 2 + H 2 BH| 



log 



I ri + HiBH[ 



(12) 



Putting together (fTTb and (fT2l . we have proved the converse part of the theorem. 

Next, we show that every rate pair (i?i,it! 2 ) within the secrecy rate region Q is achievable. Note that ([3]) is 
rectangular, so we only need to show that the corner point (i?i,i? 2 ) given by 



and 



#1 = max (\ log |I ri + HiBH^I - \ log |I r2 + H 2 BHT| ) 

O^B^S V z Z J 

I ri + HxSHj 



i? 2 = max ( — log 
0^B-<S\2 



I r2 + H 2 SHT 



- log 

2 8 



I ri + HiBHj 



(13) 



I r2 + H 2 BH2 
is achievable. 

Recall from [13] that for any jointly distributed (Vi, V 2 , X) such that (Vi, V 2 ) X -> (Yi, Y 2 ) forms a Markov 



s 



chain and E[XX T ] H S, the secrecy rate pair (Ri,R 2 ) given by 

Rx=I(y l ;Y 1 )-I{V l ;V 2 ,Y 2 ) 
and R 2 = I(V 2 ;Y 2 ) - I(y 2 ;Vi, Yi) (14) 

is achievable for the MIMO Gaussian broadcast channel dH) under the matrix power constraint (|2]). In [13], the 
achievability of the rate pair (fT4l was proved using a double-binning scheme. Specifically, the auxiliary variables 
V\ and V 2 represent the precoding signals for the confidential messages W\ and W 2 , respectively. 
Now let B be a positive semidefmite matrix such that B < S, and let 

V 1 = \J X + FU 2 

v 2 = u 2 

and X = Ui + U 2 (15) 

where Ui and U 2 are two independent Gaussian vectors with zero means and covariance matrices B and S — B, 
respectively, and 

F := BH[(I ri + HiBH[) _1 Hi. (16) 

By CB]>, 

Y fe = H fc (U!+U 2 ) + Z fc 

for k = 1,2. Note that the matrix F defined in ( fT6l ) is precisely the precoding matrix for suppressing U 2 from Yi 
[16, Theorem 1]. Hence, 

I(y 1] Y 1 )-I{V 1] V 2 )=I{V 1 ;Y x )-I{V 1 ;\J 2 ) 

= ilog|I ri +HiBHl|. (17) 

Moreover, 

/(y i; Y 2 |F 2 ) = I(U 1 +FU 2 ;H 2 (U 1 +U 2 ) + Z 2 |U 2 ) 
= /(U 1 ;H 2 U 1 + Z 2 |U 2 ) 
= /(U 1 ;H 2 U 1 + Z 2 ) 

= ilog|I r2 + H 2 BHT| (18) 
where the third equality follows from the fact that Ui and U 2 are independent. Putting together (fTTT ) and (fT8l) . we 
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have 



/(Fi;Yi) - I{V X ;V 2 ,Y 2 ) = [I{V X ; Y x ) - I(V X ;V 2 )} - I{V X ; Y 2 \V 2 ) 

= I log |I ri + HiBH^I - i log \I r2 + H 2 BHj| 



Similarly, 



I(V X , V 2 ; Yi) = J(Ui + FU 2 , U 2 ; Hi(Ui + U 2 ) + Z 2 ) 
= /(Ui,U 2 ;H 1 (U 1 +U 2 ) + Z 2 ) 
= ilog|I ri +H 1 SH[|. 



(19) 



(20) 



Thus, 



/( ;Vi,Yi) = I(V 2 ;Y X \V X ) + I(V 2 ;V X ) 



I<y x ,V 2 ;Y x ) - [I(V X ;Y X ) - I(V X ;V 2 )} 



- log 

2 B 



I ri + HxSHj 



I ri + HiBH^ 

where the last equality follows from (fTTT ) and (l20l) . Moreover, 

J(y 2 ;Y 2 ) =/(U 2 ;H 2 (U 1 +U 2 ) + Z 2 ) 



1 



log 



+ n 2 sn] 



I r2 + H 2 BH2 



Putting together (1211 ) and (1221 . we have 

I{V 2 ;Y 2 ) - I{V 2 ;V X ,Y X ) 



1 



log 



L '2 



H 2 SHJ 



i r2 + h 2 bhj 



1 



log 



I ri + HiSHj 



I ri + HiBH[ 



(21) 



(22) 



(23) 



l 2 ±JAX 2 

Finally, let B be an optimal solution to the optimization program (f5]). As mentioned previously in Remark[2l such 
a choice will simultaneously maximize the RHS of ( fT9l ) and ( |23l . Thus, the corner point ( fT3l is indeed achievable. 
This completes the proof of the theorem. 

Remark 6: Note that in standard dirty-paper coding (DPC), the precoding matrix F is chosen to cancel the known 
interference. In our scheme, such a choice plays two important roles. First, it helps to cancel the precoding signal 
representing message W 2 , so message W x sees an interference-free legitimate receiver channel. Second, it helps to 
boost the security for message W 2 by causing interference to its eavesdropper. For this reason, we call our scheme 
S-DPC, to differentiate from the standard DPC. 

Remark 7: In S-DPC, both the legitimate receiver and the eavesdropper for message W x are interference free. 
On the other hand, for message W 2 , both the legitimate receiver and the eavesdropper are subject to interference 



10 

from the precoding signal representing message W\. As we have seen in Section [TTJ the secrecy capacity of the 
MIMO Gaussian wiretap channel can be achieved with or without interference in place. Therefore, both secrecy 
capacity achieving schemes can be simultaneously implemented via S-DPC to simultaneously communicate both 
confidential messages at their respective maximal secrecy rates. 

IV. Numerical Examples 

In this section, we provide numerical examples to illustrate the secrecy capacity region of the MIMO Gaussian 
wiretap channel with confidential messages. As shown in © and (0, under both matrix and average total power 
constraints, the secrecy capacity regions C s (Hi,H2,S) and C s (Hi, H2, -P) are expressed in terms of matrix 
optimization programs (though implicit in (O). In general, these optimization programs are not convex, and hence, 
finding the boundary of the secrecy capacity regions is nontrivial. 

In [12], a precise characterization of the secrecy capacity region C s (Hi,H2, P) was obtained for the MISO 
Gaussian broadcast channel using the generalized eigenvalue decomposition [17, Ch. 6.3]. For the aligned MIMO 
Gaussian wiretap channel, [10] provided an explicit, closed-form expression for the secrecy capacity. In the 
following, we generalize the results of [10] and [12] to the general MIMO Gaussian broadcast channel under 
the matrix power constraint. 

Let <f>j, j = 1, . . . , t, be the generalized eigenvalues of the pencil 

(it + SiHTHxSi, I t + S5HTH 2 Si) . (24) 

Since both I t + S2HJH4S2 and I t + S2H2H2S2 are strictly positive definite, we have <frj > for j = 1, . . . , t. 
Without loss of generality, we may assume that these generalized eigenvalues are ordered as 

4>i > ■ ■ ■ > <t> P > 1 > 4> P +i >■■■>&> 0, 

i.e., a total of p of them are assumed to be greater than 1. We have the following characterization of the secrecy 
capacity of the MIMO Gaussian wiretap channel under the matrix power constraint, which is a natural extension 
of [10]. 

Theorem 3: The secrecy capacity C s (Hi, H2, S) of the MIMO Gaussian broadcast channel (HJ with confidential 
message W (intended for receiver 1 but needing to be kept secret from receiver 2) under the matrix power constraint 
(121) is given by 

1 p 

C r «(Hi,H 2j S) = -^log0 j (25) 

i=i 

where fy, j = 1, . . . , p, are the generalized eigenvalues of the pencil (l24l that are greater than 1. 



1 1 

Remark 8: Note that It + S2H2H2S2 is invertible, so computing the generalized eigenvalues of the pencil (1241 
can be reduced to the problem of finding standard eigenvalues of a related semidefinite matrix [17, Ch. 6.3]. Hence, 
the secrecy capacity expression d25l ) is computable. 

A proof of the theorem following the approach of [10] is provided in Appendix ITT] As a corollary, we have the 
following characterization of the secrecy capacity region of the MIMO Gaussian broadcast channel with confidential 
messages under the matrix power constraint. 

Corollary 2: The secrecy capacity region C s (Hi,H2, S) of the MIMO Gaussian broadcast channel £T|) with 
confidential messages W\ (intended for receiver 1 but needing to be kept secret from receiver 2) and W2 (intended 
for receiver 2 but needing to be kept secret from receiver 1) under the matrix constraint (O is given by the set of 
nonnegative rate pairs (i?x,i?2) such that 

1 9 

i?i<-J>g<^ 



and 



j=p+l T 3 



where fy, j = 1, . . . , p, are the generalized eigenvalues of the pencil (1241) that are greater than 1, and <pj, j = 
p + 1, . . . , t, are the generalized eigenvalues of the pencil (T24l that are less than or equal to 1. 
Proof: By Theorem Q] we only need to show that the secrecy capacity 

1 * 1 
C a (H 2 ,Hi,S) = - lo S^T- 

Consider the pencil 

(it + siH|H 2 S2, I t + S"HjHiS») . (27) 

Note that the pencils (l24l and (|27T ) are generated by the same pair of semidefmite matrices but with different order. 
Therefore, the generalized eigenvalues of the pencil (|27T ) are given by 

111 1 

o< — <•••< — <i< — <•••<-• 

4>i <Pp 4>p+i <Pt 

Applying Theorem [3] for C s (H2,Hi,S) completes the proof of the corollary. ■ 
Under the average total power constraint, we have not been able to find a computable secrecy capacity expression 
for the general MIMO case. We can, however, write [14, Lemma 1] 

C S (H!,H2,P)= |J C a (Hi,H 2 ,S). 

S^O, Tr(S)<P 
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Fig. 3. Secrecy rate regions of the MIMO Gaussian broadcast channel under the average total power constraint. 

For any given semidefinite S, C s (Hi,H2, S) can be computed as given by (|26l ). Then, the secrecy capacity region 
C s (Hi, H2, P) can be found through an exhaustive search over the set {S : S y and Tr(S) < P}. 
Let hn = (0.3 2.5), h 12 = (2.2 1.8), h 2 i = (1.3 1.2), h 22 = (1.5 3.9) and P = 12, and let 



Hz 



ifci 



, k = l,2. 



h 



A- 2 



The secrecy capacity regions C s (hn, I122, i 3 ), C s (Hi, h22, -P), C s (hn, H2, P) and C s (Hi,H 2 ,P) are illustrated 
in Fig. [3] For comparison, we have also plotted the secrecy rate regions achieved by the simple zero-forcing 
(ZF) strategy. In ZF, each of the confidential messages is encoded using a vector Gaussian signal. To guarantee 
confidentiality, the covariance matrices of the transmit signals are chosen in the null space of the channel matrix 
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Fig. 4. Rate regions of the MIMO Gaussian broadcast channel under the power matrix constraint. 



at the unintended receiver. Hence, the achievable secrecy rate region is given by 



KffHt.H^P) 



U 



(Ri,R 2 



BibO, B 2 ^0, Tr(Bi+B 2 )<P 
H 2 B 1= 0, H!B 2 =0 



Ri < glogllr, + HiBiHjl 
R2 < §log|I r2 +H 2 B 2 H|| 



(28) 



Note that unlike the secrecy capacity region expression (|7]), computing the rate region d28l ) only involves solving 
convex optimization programs. As shown in Fig. |3j in all four scenarios, ZF is strictly suboptimal as compared 
with S-DPC. In particular, if the channel matrix of the unintended receiver has full row rank, ZF cannot achieve 
any positive secrecy rate for the corresponding confidential message. On the other hand, S-DPC can always achieve 
positive secrecy rates for both confidential messages unless the MIMO Gaussian broadcast channel is degraded. 
Finally, let 



Hi 



H 2 




and 



5.0 -0.7 -2.0 
-0.7 3.8 -2.5 
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Fig. S] illustrates the secrecy capacity region C s (Hi,H2,S) of the MIMO Gaussian broadcast channel (Q~|) under 
the matrix power constraint Q. Here, the secrecy capacity region C s (Hi,H2,S) is plotted based on the com- 
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putable expression (|26l ). Also in the figure are the secrecy rate region 1Z^ f (Hi, H 2 , S) achieved by ZF strategy 
and the nonsecrecy capacity region 7v!. DPC (Hi, H2, S) achieved by standard DPC [14]. As expected, we have 
(Hi,H 2 ,S) cC s (Hi,H 2 ,S) Cft DPC (Hi,H 2 ,S). 

V. Concluding Remarks 

In this paper, we have considered the problem of communicating two confidential messages over the two-receiver 
MIMO Gaussian broadcast channel. Each of the confidential messages is intended for one of the receivers but needs 
to be kept asymptotically perfectly secret from the other. Precise characterizations of the secrecy capacity region 
have been provided under both matrix and average total power constraints. Surprisingly, under the matrix power 
constraint, both confidential messages can be transmitted simultaneously at their respective maximal secrecy rates. 

To prove this result, we have revisited the problem of the MIMO Gaussian wiretap channel and proposed a new 
coding scheme that achieves the secrecy capacity of the channel. Unlike the previous scheme considered in [4]-[9] 
where prefix coding is not applied, the new coding scheme uses artificial vector Gaussian noise as a way of prefix 
coding. Moreover, the optimal covariance matrix of the artificial noise coincides with that of the transmit signal 
in the previous scheme. This allows both schemes to be overlayed via S-DPC without sacrificing the secrecy rate 
performance for either of them. We believe that the new understanding of the MIMO Gaussian wiretap channel 
problem gained in this work will help to solve some other multiuser secret communication problems. 

Appendix I 
Proof of Theorem [2] 

In this appendix, we prove Theorem |2] As mentioned previously in Remark SI the secrecy rate on the RHS of 
([Tol l can be achieved by a coding scheme that combines Gaussian random binning and prefix coding. We therefore 
concentrate on the converse part of the theorem. 

Following [9], we will first prove the converse result for the special case where the channel matrices Hi and H 2 
are square and invertible. Next, we will broaden the result to the general case by approximating arbitrary channel 
matrices Hi and H 2 by square and invertible ones. For brevity, we will term the special case as the aligned MIMO 
Gaussian wiretap channel and the general case as the general MIMO Gaussian wiretap channel. 

A. Aligned MIMO Gaussian Wiretap Channel 

Consider the special case of the MIMO Gaussian broadcast channel dH) where the channel matrices Hi and H 2 
are square and invertible. Multiplying both sides of £T|) by H^ 1 , the channel model can be equivalently written as 

Y k [m] = X[m] + Z k [m], k = 1,2 (29) 
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where {Zfc[m]} m is an i.i.d. additive vector Gaussian noise process with zero mean and covariance matrix 



(30) 



Denote by C S (N 2 , Ni, S) the secrecy capacity of (l29l (viewed as a MIMO Gaussian wiretap channel with receiver 2 
as legitimate receiver and receiver 1 as eavesdropper) under the matrix power constraint (fSJ). We have the following 
characterization of C s (N2,Ni,S). 
Lemma 1: The secrecy capacity 



C s (N 2) Ni,S) = max -log 
O^B^S V 2 



S + N 2 



B + N 2 



1 



log 



S + N x 



B + N x 



(31) 



Proof: The achievability of the secrecy rate on the RHS of (1311 ) follows from the achievability of the secrecy 
rate on the RHS of ( fTOl for the general case and the definition of in (|30t - To prove the converse result, we 
will follow [9] and consider a channel-enhancement argument as follows. 

Let us first assume that S y 0. In this case, let B* be an optimal solution to the optimization program on the 
RHS of (l3Tb . Then, B* must satisfy the following Karush-Kuhn-Tucker conditions [9]: 

(B* + Ni)- 1 + Mi = (B* + N 2 )" 1 +M 2 (32a) 
B*Mi = (32b) 
and (S - B*)M 2 = (32c) 

where Mi and M 2 are positive semidefmite matrices. Let Ni be a real symmetric matrix such that 

(B* + NO" 1 = (B* + NO" 1 + Ml 

From Eqns. (23), (25), (31) and (34) of [9], we have 

0^N a r< {Ni,N 2 }, 



(33) 



(34) 



and 



B* + Ni 




B* + Ni 


Ni 




Ni 


S + Ni 




S + N 2 


B* + Ni 




B* + N 2 



(35) 



(36) 
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Now consider an enhanced MIMO Gaussian broadcast channel: 



Yi[m] = X[m] + Zi[m] 
and Y 2 [m] = X[m] + Zi[m] 



(37) 



where {Zi[m]} m and {Zi[m]} m are i.i.d. additive vector Gaussian noise processes with zero means and covariance 
matrices Ni and Ni, respectively. Denote by C s (Ni,Ni,S) the secrecy capacity of 071 ) (viewed as a MIMO 
Gaussian wiretap channel with receiver 2 as legitimate receiver and receiver 1 as eavesdropper) under the matrix 
constraint Note from ( f34T > that Ni ^ Ni, so the enhanced MIMO Gaussian wiretap channel (|37l ) is degraded. 
Hence, 

C,(Ni,Ni,S) = 




log 



B* + N 2 
S + N 2 



S + Ni 
B* + Ni 



S + Nx 



B* + N 2 



log 



B* + N! 



(38) 



where the first equality follows from [9, Theorem 1]; the third equality follows from (1351 ): and the fifth equality 
follows from (l36l ). 

Finally, note from (l34l) that Ni < N 2 , i.e., the legitimate receiver in the enhanced wiretap channel (l37l) receives 
a better signal that the legitimate receiver in the original wiretap channel d29l ). Therefore, 



1 



S + N 2 


— - log 


S + N x 


B* + N 2 


2 5 


B* + Ni 



log 

z 

where the last equality follows from d38l ). This proved the desired converse result for S y 0. 
For the case when S >z 0, |S| = 0, let 

9 = Rank(S) < t. 



Following the same footsteps as in the proof of [14, Lemma 2], we can define an equivalent aligned MIMO Gaussian 
wiretap channel with 9 transmit and receive antennas and a new covariance matrix power constraint that is strictly 



17 

positive definite. Hence, we can convert the case when S y 0, |S| = to the case when S >- with the same 
secrecy capacity. This argument can be formally described as follows. 
Since S is positive semidefinite, we can write 

S = QsAsQg 

where Qs is an orthogonal matrix and 

A s = Diag(0, ... ,0,si,... ,s ) 
t- 

is diagonal with Sj > 0, j = 1, . . . , 6. For k = 1,2, write 

Q s N fc Qs = 



where Cfc, and are (sub)matrices of size (t — 9) x (t — #), (t — 0) x 9 and 9x9, respectively. Let 

0/ 



>(t-e)xt 



k = 1,2. 



We now define an intermediate and equivalent channel by multiplying both sides of (1291 ) by an invertible matrix 
A fc Q s : 

Y^rn] = X'[m] + Z' fc [m], fc = 1, 2 (39) 

where 

Y' k [m]=A k QlY k [m] 
X'[m] = A fc Q s X[m] 
and Z' fe [m] = AfeQ^Zfefm]. 

Then, the covariance matrix N' fc of the additive Gaussian noise vector Zjjm] is given by 

K = I- (40) 

V° Efc-DJC^D*; 

and the matrix power constraint © becomes 

1 n 

- ^ X'[m]X /T [m] r< S' (41) 

77* 
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where 

S' = AfcQ^SQsAT 
= A k A s A T k 

= As- (42) 

Note from (l42l that S' is diagonal with first i — 9 diagonal elements equal to zero. Thus, the matrix constraint 
(|4T|) requires that the first t — 9 elements of X'[m] be zero. Moreover, from (l40b . the first t — 9 and the rest of 
9 elements of Zjjm] are uncorrelated and hence must be independent as Zjjm] is Gaussian. Therefore, only the 
latter 9 antennas transmit/receive information regarding message W. This allows us to define another equivalent 
aligned MIMO Gaussian broadcast channel with 9 antennas at the transmitter and each of the receivers: 

Yfc[m] = X[m] + ZfcH, k = 1, 2 (43) 

where 

YjtH = AY[H 
X[m] = AX'[m] 
Z fe [m] = AZ' fc [m] 

and A = [0 e x Ig] . Now, the matrix power constraint becomes 

1 n - — 

- X[m]X T [?n] ^ S (44) 

m=l 

where 

S = AS' A T 

= Diag(si,...,s ). (45) 

Note that the matrix power constraint S is strictly positive definite, so we can apply the previous result to the new 
wiretap channel (l43l) . This completes the proof of the lemma. ■ 



B. General MIMO Gaussian Wiretap Channel 

For the general case, we may assume that the channel matrices Hi and H2 are square but not necessarily invertible. 
If that is not the case, we can use singular value decomposition (SVD) to show that there is an equivalent channel 
which does have t x t square channel matrices. That is, we can find a new channel with square channel matrices 
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which are derived from the original ones via matrix multiplications. The new channel is equivalent to the original 
one in preserving the secrecy capacity under the same power constraint. 
Consider using SVD to write the channel matrices as follows: 

H k = U k A k Vl A; =1,2 

where and are t x t orthogonal matrices, and A k is diagonal. We now define a new MIMO Gaussian 
broadcast channel which has invertible channel matrices: 



Y k [m] = H fe X[m] + Z fc [m], k = 1, 2 



(46) 



where 



H* 



V k (A k + aI t )V 



for some a > 0, and {Zfc[m]} m is an i.i.d. additive vector Gaussian noise process with zero mean and identity 
covariance matrix. Note that the channel matrices H^, k = 1,2, are invertible. By Lemma [T] the secrecy capacity 
C s (H 2 ,Hi,S) of d29l) (viewed as a MIMO Gaussian wiretap channel with receiver 2 as legitimate receiver and 
receiver 1 as eavesdropper) under the matrix power constraint © is given by 



C B (H 2 ,Hi,S) = max -log 



It + H2SH0 



It + H2BH2 



Finally, let a J. 0. We have — > H^, k = 1, 2 and hence 



Moreover, by Eqns. (45) and (46) of [9], 



L, + H 2 SH^ 



I r2 + H 2 BHJ 



Ij + HxSHl 



It + HiBH! 



I ri +HiSHf 



I ri + HiBH[ 



<7 S (H 2 , Hi, S) < C a (H 2 ,Hi,S) + 0{a) 



(47) 



where 0(a) — > in the limit as a J. 0. Thus, we have the desired converse result 



-o™s(J log 



L,, + H 2 SH^ 



I ra + H 2 BH T 



1 



log 



I ri + HiSH[ 



12—2 

by letting a { on the RHS of Wf\ . This completes the proof of the theorem 



l ri + HiBH[ 
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Appendix II 
Proof of Theorem [3] 

In this appendix, we prove Theorem [3j Without loss of generality, we may assume that the matrix power constraint 
S is strictly positive definite and the channel matrices Hi and H2 are square but not necessarily invertible. We 
start with the following simple lemma. 

Lemma 2: For any t x t matrices B and H such that B y 0, we have 

I It + HBH T | = \I t + H T HB| . (48) 

In particular, if B = It, we have 

I It + HH T | = \I t + H T H| . (49) 
Proof: Note that if H is invertible, the equalities in (|48l and d49l are trivial. Otherwise, consider using SVD 
to rewrite H as 

H = UAV T 

where U and V are t X t orthogonal matrices, and 

A = Diag(0 1 _^0,Ai,...,A 6 ) 

t-b 

is diagonal with Xj > 0, j = 1, . . . , b. Write 



V T BV = 




where Cb, Db and Eb are (sub)matrices of size (t — b) x (t — b), (t — b) x b and b x b, respectively. Then, 

I It + HBH T | = |I t + UAV T BVAU T | 
= I If + AV T BVA| 

= |I 6 + AE B A| (50) 
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where A = Diag(Ai, . . . , A&). On the other hand, 



|L + H T HB| 



It + VA 2 V T B| 
It + A 2 V T BV| 
I b + A 2 E B 
I 6 + AE B A| 



(51) 



where the last equality follows from the fact that A is invertible. Putting together (l50l and (I5TI ) proves the equality 
in (|48T ). This completes the proof of the lemma. ■ 
We are now ready to prove Theorem following the approach of [10]. Let 



O k :=nlH k k = l,2, 



(52) 



and let $ denote the generalized eigenvalue matrix of the pencil 



such that 



It + S^OiSs, It + S50 2 S" 



3> 



*i 
¥ 2 



where <&i = Diag{0i, . . . , 4> p } and $2 = Diag{0 p +i , . . . , ^> t }. Let G be the corresponding generalized eigenvector 
matrix such that 



G T (It + SaOiS* G = * 



and 



G T (It + S2O9S2 ) G = L. 



(53) 



Now define 



O := S 



I 



t-pj 



(54) 



Since the generalized eigenvalues are ordered as 



> • • • > <t> P > 1 > 0p+i > • • • > <h > 
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we have 



Hence by d33) and (1531) . 



and 



*i 
*i 



I 



t-p. 



O b {Oi,0 2 }. 



(55) 



It follows that 



C,(Hi, H 2 , S) = max [J log \I t + HxBHjl - \ log |I t + B^BH^ 

O^B^S \ A A J 



max — log 
o^B^s\2 



max I — log 
o-<B^s V 2 



< max ( — log 
~ o^B:<s \2 

= max ( — log 
o-<B^s V 2 



I t + Bm[HiB5 



I f + B2HIH2B: 



L + B2O1B: 



L + B^OBi 



1 



log 



OiBO" 



- log 

2 B 

- log 

2 & 



1^1 



I t + OiSO 



- - log 

2 & 

- log 

2 8 



I t + B2O2B5 
It + Bi0 2 B 
It + 0|BO| 

It + ofsof 1 

I t + Si0 2 Si 



- log 

2 & 

1, 

- log 

2 & 



^ lo g|*i| 
1 p 

2Zl lo S ( 

3=1 



(56) 
(57) 
(58) 
(59) 
(60) 
(61) 
(62) 

(63) 



where ([56]>, ([59]) and dgT) follow from (|49]>; ([57]) follows from the definition of Oi in ([52]>; ([58]) follows from the 
fact that Oi H O (see d53V); d60l follows from the fact that 2 ^ O (see (gU); and d62j) follows ^3) and the 
definition of O in (l54l ). 

To prove the reverse inequality, let G = [Gi G2] where Gi and G2 are (sub)matrices of size t x p and t x p, 
respectively, and let 



B* := S2G 



(G^Gx)- 1 0' 



G T S2 . 



(64) 
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Then, B* is positive semidefinite. Moreover, we may verify that B* ■< S as follows. Note that G is invertible, so 
it is enough to show that 



1 (G T G) _1 . 



Note that 



G T G 



GjGi G T G 5 
Gjd G]G 2 



Using block inversion, we may obtain 

_! /(GTG 1 )- 1 + (G[G 1 )- 1 GTG 2 Eg 1 G 2 t G 1 (G[G 1 )- 1 (GlGiJ-^jGaE 



(GTG)' 



-l 

G 



where 



E^GjGiCGjGi)- 1 



E G = GJG 2 - G 2 r Gi(Gj"Gi)~ 1 Gj"Gs 



E 



-i 



Since G T G is positive definite, we have 



E G y 



and hence 



(GTG)" 1 



'(GfGO- 1 o\ _ / (GfGiJ-^lGaE^GjGiCGjGi)- 1 (GjGi^GjGaEG 1 ' 
0/ \ E^GjGiCGfGi)- 1 E^ 1 

'l p (G T G 1 )- 1 GTG 2 \ (o 



Lt-p 



^0 Eq7 VGjdCGlGi)- 1 I 



>- 0. 



By 63, 



C s (H 1)H2 ,S)= o n^ s -log 



1 



1 _ 1 



It + B^OiBi 
I t + B*3QiB 



- log 

2 & 



- log 

2 & 



>ilo 
~ 2 

= i log |I t + B*Oi | - i log |I t + B*Q 2 



It + B^0 2 Bi 
I t + B*3Q 2 B* 



(65) 
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where the last equality follows from ( |48"T ). From d53l ), we have 



Oi = S~2 (G^G 1 - I t ) S-2 
and Q 2 = S~3 (G^G" 1 - I t ) S - 3 . 



(66) 



Hence, 



, / (GjGi) -1 o\ 
B*Oi = S = G V 1 J G T (G _T <&G — It) S" 

V o o, 



SiG 



S5G 



S^G 



(GIG,)" 1 0\ /(GIG,)- 1 



0, 







G^S" 





0^ 


H 


V o 


* 2 y 





(G[Gi) _1 *i - I p - (GjGt)- 1 GjG 2 




0, 

(GIG!)" 1 (A /GjGi G|G 2 

0/ \GjGi GjG 2y 

G -1 S - " 



G _1 S" 



giving 



|I t + B*Oi| = IGlGif 1 |*i| 



Similarly, we may obtain 



B*Oo = SiG 



( G i G i) _1 - Ip 




(GIG 1 )- 1 GJG 2 




G _1 S _ a 



and 



|I t + B*0 2 | = IGjGil" 1 
Substituting (l67l ) and (I68T ) into (I65T ). we may obtain 



C 8 (Hi,H 2l S) > - log |*i | 



i=i 



Putting together (1631 ) and d69l establishes the desired equality 



i p 

C s (Hi,H 2 ,S) = -^log^. 



(67) 



(68) 



(69) 
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This completes the proof of the theorem. 
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